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Abstract 



cover syntactic ambiguities (cf. (Pinkal 95; EggLe- 



beth 95; Schichlen 96)). Schiehlen's approach is out- 



We investigate the problem of determin- 
ing a compact underspecified semantical 
representation for sentences that may be 
highly ambiguous. Due to combinatorial 
explosion, the naive method of building se- 
mantics for the different syntactic readings 
independently is prohibitive. We present 
a method that takes as input a syntac- 
tic parse forest with associated constraint - 



based semantic construction rules and di- 
rectly builds a packed semantic structure. 
The algorithm is fully implemented and 
runs in 0(n 4 log(n)) in sentence length, if 
the grammar meets some reasonable 'nor- 
mality' restrictions. 



1 Background 

One of the most central problems that any NL sys- 
tem must face is the ubiquitous phenomenon of am- 
biguity. In the last few years a whole new branch de- 
veloped in semantics that investigates underspecified 
semantic representations in order to cope with this 
phenomenon. Such representations do not stand for 
the real or intended meaning of sentences, but rather 
for the possible options of interpretation. Quanti- 
fier scope ambiguities are a semantic variety of am- 
biguity that is handled especially well by this ap- 
proach. Pioneering work in that direction has been 
( [Alshawi 92] ) and QReyle 93p . 

More recently there has been growing interest in 
developing the underspecification approach to also 
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standing in that he fully takes into account syntactic 
constraints. In ( [Schichlcn 96 ) he presents an algo- 
rithm which directly constructs a single underspec- 
ified semantic structure from the ideal "underspeci- 
fied" syntactic structure, a parse forest. 

On the other hand, a method for producing 
"packed semantic structures" , in that case "packed 
quasi-logical forms" , has already been used in the 



Core Language Engine, informally described in (Al- 



shawi 92, Chap. 7). However, this method only pro- 
duces a structure that is virtually isomorphic to 
the parse forest, since it simply replaces parse for- 
est nodes by their corresponding semantic oper- 
ators. No attempt is made to actually apply se- 
mantic operators in the phase where those "packed 
QLFs" are constructed. Moreover, the packing of 
the QLFs seems to serve no purpose in the process- 
ing phases following semantic analysis. Already the 
immediately succeeding phase "sortal filtering" re- 
quires QLFs to be unpacked, i.e. enumerated. 

Contrary to the CLE method, Schiehlen's method 
actively packs semantic structures, even when they 
result from distinct syntactic structures, extracting 
common parts. His method, however, may take time 
exponential w.r.t. sentence length. Already the se- 
mantic representations it produces can be exponen- 
tially large, because they grow linear with the num- 
ber of (syntactic) readings and that can be exponen- 
tial, e.g., for sentences that exhibit the well-known 
attachment ambiguity of prepositional phrases. It is 
therefore an interesting question to ask, whether we 
can compute compact semantic representations from 
parse forests without falling prey to exponential ex- 
plosion. 

The purpose of the present paper is to show that 
construction of compact semantic representations 
like in Schiehlen's approach from parse forests is not 
only possible, but also cheap, i.e., can be done in 



start (DRS) — > S ([_,_, ltop] ,[], DRS) . 

s ( [Event, VerbL, DomL] , DRS_i, DRS_o) — > 
np ( [X, VerbL, DomL] , DRS_i , DRS1 ) , 
vp ( [Event, X, VerbL, DomL] , DRS1, DRS_o) . 

s ( [Event, VerbL, DomL] , DRS_i, DRS_o) — > 
s ( [Event, VerbL, DomL] , DRS_i, DRS1) , 
pp ( [Event, VerbL, DomL] , DRS1, DRS_o) . 

vp ( [Ev,X, VerbL, DomL] , DRS_i, DRS_o) — > 
vt ( [Ev,X,Y, VerbL, DomL] ,DRS_i,DRSl) , 
np ( [Y, VerbL, DomL] ,DRSl,DRS_o) . 

np ( [X,VbL,DomL] , DRS_i, DRS_o) — > 

det ( [X,NounL,VbL,DomL] ,DRS_i,DRSl) , 
n ( [X, NounL, DomL] , DRS1, DRS_o) . 

n ( [X, NounL, DomL] , DRS_i, DRS_o) — > 
n ( [X, NounL, DomL] , DRS_i , DRS1 ) , 
pp ( [X, NounL, DomL] , DRS1, DRS_o) . 

pp ( [X, L, DomL] , DRS_i, DRS_0) — > 
prep (Cond, X, Y ) , 

np ( [ Y, L, DomL] , [L:Cond|DRS_i] ,DRS_0) . 



vt ( [Ev, X, Y, L,_DomL] , DRS_i, DRS) — > [saw], 
{DRS= [L: see (Ev, X, Y) | DRS_i] } . 

det { [X, Lab, VerbL, _] , DRS_i, DRS) — > 
[a] , 

{DRS= [It (Lab, ltop) , It (VerbL, Lab) , 
Lab:X|DRS_i] , 
gensym (1, Lab) , gensym (x, X) } . 

det ( [X, ResL, VbL, DomL] ,DRS_i,DRS) — > 
[every] , 

{DRS= [It (L, DomL) , It (VbL, ScpL) , ResL:X, 
L : every (ResL, ScpL) |DRS_i], 
gensym (1, L) , gensym (1, ResL) , 
gensym (1, ScpL) , gensym (x, X) } . 

np ( [X,_,_] , DRS_i, DRS) — > [i], 

{DRS= [ltop : X, anchor (X, speaker) | DRS_i] , 
gensym (x, X) } . 

n( [X,L,_] ,DRS, [L:man(X) |DRS] ) — > [man], 
n ( [X, L,_] , DRS, [L:hill (X) | DRS] ) — > [hill]. 

prep (on (X, Y) , X, Y) — > [on], 
prep (with (X, Y) , X, Y) — > [with]. 



Figure 1: Example DCG 



polynomial time. 

To illustrate our method we use a simple DCG 
grammar for PP-attachment ambiguities, adapted 



from (3chichlcn 96), that yields semantic represen- 
tations (called UDRSs) according to the Underspec- 
ified Discourse Representation Theory ( Reyle 95 ; 
KampReyle 93 ). The grammar is shown in Fig. 

The UDRSs constructed by the grammar are flat 
lists of the UDRS-constraints I < I' (subordination 
(partial) ordering between labels; Prolog represen- 
tation: It (/,/')), I '■ Cond (condition introduction 
in subUDRS labeled l) 1 I : X (referent introduc- 
tion in i), Z : GenQuant(l' ' ,1") (generalised quan- 
tifier) and an anchoring function. The meaning of a 
UDRS as a set of denoted DRSs can be explained 
as follows.]] All conditions with the same label form 
a subUDRS and labels occurring in subUDRSs de- 
note locations (holes) where other subUDRSs can 
be plugged into. The whole UDRS denotes the set 
of well-formed DRSs that can be formed by some 
plugging of the subUDRSs that does not violate the 
ordering <. Scope of quantifiers can be underspec- 
ified in UDRSs, because subordination can be left 
partial. 

In our example grammar every nonterminal has 
three arguments. The 2nd and the 3rd argument rep- 
resent a UDRS list as a difference list, i.e., the UDRS 



1 Readers unfamiliar with DRT should think of these 
structures as some Prolog terms, representing seman- 
tics, built by unifications according to the semantic rules. 
It is only important to notice how we extract common 
parts of those structures, irrespective of the structures' 
meanings. 



is "threaded through" . The first argument is a list of 
objects occurring in the UDRS that play a specific 
role in syntactic combinations of the current node.0 
An example of a UDRS, however a packed UDRS, is 
shown later on in 

To avoid the dependence on a particular grammar 
formalism we present our method for a constraint- 
based grammar abstractly from the actual constraint 
system employed. We only require that semantic 
rules relate the semantic 'objects' or structures that 
are associated with the nodes of a local tree by em- 
ploying constraints. E.g., we can view the DCG rule 
s — > np vp as a relation between three 'seman- 
tic construction terms' or variables SemS, SemNP, 
SemVP equivalent to the constraints 
SemS = [ [Event, VerbL, DomL , TopL] ,DRS_i,DRS_o] 
SemNP = [[X, VerbL, DomL, TopL] ,DRS_i,DRSl] 
SemVP = [ [Event , X , VerbL , DomL , TopL] , DRS 1 , DRS_o] 

Here is an overview of the paper. gives the pre- 
liminaries and assumptions needed to precisely state 
the problem we want to solve. ^ presents the ab- 
stract algorithm. Complexity considerations follow 
in §|]. Finally, we consider implementation issues, 
present results of an experiment in §||, and close with 
a discussion. 

2 The Problem 



As mentioned already, we aim at calculating from 
given parse forests the same compact semantic struc- 
tures that have been proposed by ( gchichlcn 96 ), 
i.e. structures that make explicit the common parts 
of different syntactic readings, so that subsequent 



2 E.g., for an NP its referent, as well as the upper and 
lower label for the current clause and the top label. 



semantic processes can use this generalised infor- 
mation. As he does, we assume a constraint-based 



grammar, e.g. a DCG ( PereiraWarren 80) or HPSG 
( PollardSag 94 ) , in which syntactic constraints and 
constraints that determine a resulting semantic rep- 
resentation can be seperated and parsing can be per- 
formed using the syntactic constraints only. 

Second, we assume that the set of syntax trees can 



be compactly represented as a parse forest (cf. (Ear 
Ley 70; BillotLang 8G; Tomita 86| )). Parse forests are 



rooted labeled directed acyclic graphs with AND- 
nodes (standing for context-free branching) and OR- 
nodes (standing for alternative subtrees), that can 
be characterised as follows (cf. Fig. || for an exam- 
ple)J3 

1. The terminal yield as well as the label of two 
AND-nodes are identical, if and only if they 
both are children of one OR-node. 

2. Every tree reading is a valid parse tree. 

Tree readings of such graphs are obtained by replac- 
ing any OR-node by one of its children. Parse forests 
can represent an exponential number of phrase 
structure alternatives in o(n 3 ) space, where n is the 
length of the sentence. The example uses the 3 OR- 
nodes (A, B, C) and the AND-nodes 1 through 32 
to represent 5 complete parse trees, that would use 
5 x 19 nodes. 

Third, we assume the rule-to-rule hypothesis, i.e., 
that the grammar associates with each local tree a 
'semantic rule' that specifies how to construct the 
mother node's semantics from those of its children. 

Hence, input to the algorithm is 

• a parse forest 

• an associated semantic rule for every local tree 
(AND-node together with its children) therein 

• and a semantic representation for each leaf 
(coming from a semantic lexicon). 

To be more precise, we assume a constraint lan- 
guage C over a denumerable set of variables X, 
that is a sublanguage of Predicate Logic with equal- 
ity and is closed under conjunction, disjunction, 
and variable renaming. Small greek letters <f>, ip will 
henceforth denote constraints (open formulae) and 
letters X, Y, Z (possibly with indeces) will denote 
variables. Writing 4>(Xi, . . . ,Xk) shall indicate that 
X\ , . . . , Xk are the free variables in the constraint (j). 



The graphical representation of an OR-node is a box 
surrounding its children, i.e. the AND-OR-graph struc- 



ture of 




Frequently used examples for constraint languages 
are the language of equations over first-order terms 
for DCGs,^] PATR-style feature-path equations, or 
typed feature structure description languages (like 
the constraint languages of ALE ( parpenter 92 ) or 
CUF (DorreDorna 93)) for HPSG-style grammars. 

Together with the constraint language we require 
a constraint solver, that checks constraints for satis- 
fiability, usually by transforming them into a normal 
form (also called 'solved form'). Constraint solving 
in the DCG case is simply unification of terms. 

The semantic representations mentioned before 
are actually not given directly, but rather as a con- 
straint on some variable, thus allowing for partiality 
in the structural description. To that end we assume 
that every node in the parse forest v has associated 
with it a variable X v that is used for constraining the 
(partial) semantic structure of v. The semantics of 
a leaf node [i is hence given as a constraint 4>^{X^), 
called a leaf constraint. 

A final assumption that we adopt concerns the na- 
ture of the 'semantic rules'. The process of semantics 
construction shall be a completely monotonous pro- 
cess of gathering constraints that never leads to fail- 
ure. We assume that any associated (instantiated) 
semantic rule r(y) of a local tree (AND-branching) 
v{y\, fk) determines z/'s semantics £(f) as fol- 
lows from those of its children: 

E(i/) = 3X U1 . . . 3X Vk (<j> r ( v ) (X v , X Vl , . . - ,X Uk ) A 
E(^) A . . . A E(^)). 

The constraint (f> r uA {X v , X V1 , . . . , X Vk ) is called the 
rule constraint for v. It is required to only depend 



on the variables X v , X v 



. , X Vk . Note that if the 



same rule is to be applied at another node, we have 
a different rule constraint. 

Note that any E(z/) depends only on X v and can 
be thought of as a unary predicate. Now, let us con- 
sider semantics construction for a single parse tree 
for the moment. The leaf constraints together with 
the rules define a semantics constraint E(f) for ev- 
ery node v, and the semantics of the full sentence 
is described by the E-constraint of the root node, 
T,(root). In the E-constraints, we actually can sup- 
press the existential quantifiers by adopting the con- 
vention that any variable other than the one of the 
current node is implicitly existentially bound on the 
formula toplevel. Name conflicts, that would force 
variable renaming, cannot occur. Therefore E(rooi) 
is (equivalent to) just a big conjunction of all rule 



4 DCG shall refer in this paper to a logically pure ver- 
sion, Definite Clause Grammars based on pure Prolog, 
involving no nonlogical devices like Cut, var/1, etc. 
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Figure 2: Example 

constraints for the inner nodes and all leaf con- 
straints. 

Moving to parse forests, the semantics of an OR- 
node v(vi, ■ ■ . , Vk) is to be defined as 

E(i/) = 3X^ . . . 3X„ k (E(^) A X„=X V , V ... 

yn{u k )AX v =x Vk ), 

specifying that the set of possible (partial) semantic 
representations for v is the union of those of i/'s chil- 
dren. However, we can simplify this formula once and 
for all by assuming that for every OR- node there is 
only one variable X v that is associated with it and all 
of its children. Using the same variable for v\ . . . Vk 
is unproblematic, because no two of these nodes can 
ever occur in a tree reading. Hence, the definition we 
get is 

E(i/)=%)V...VE(4 

Now, in the same way as in the single-tree case, we 
can directly "read off" the E-constraint for the whole 



d n p d n 

28 29 30 31 32 

the hill with the tele 

of a parse forest 

parse forest representing the semantics of all read- 
ings. Although this constraint is only half the way 
to the packed semantic representation we are aim- 
ing at, it is nevertheless worthwhile to consider its 
structure a little more closely. Fig. || shows the struc- 
ture of the E-constraint for the OR-node B in the 
example parse forest. 

In a way the structure of this constraint directly 
mirrors the structure of the parse forest. However, 
by writing out the constraint, we loose the sharings 
present in the forest. A subformula coming from a 
shared subtree (as £(18) in Fig. ||) has to be stated 
as many times as the subtree appears in an unfolding 
of the forest graph. In our PP-attachment example 
the blowup caused by this is in fact exponential. 

On the other hand, looking at a E-constraint as a 
piece of syntax, we can represent this piece of syntax 
in the same manner in which trees are represented in 
the parse forest, i.e. we can have a representation of 
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Figure 3: Constraint E(i3) of example parse forest 



E(rooi) with a structure isomorphic to the forest's 
graph structure.^ In practice this difference becomes 
a question of whether we have full control over the 
representations the constraint solver employs (or any 
other process that receives this constraint as input). 
If not, we cannot contend ourselves with the possi- 
bility of compact representation of constraints, but 
rather need a means to enforce this compactness on 
the constraint level. This means that we have to in- 
troduce some form of functional abstraction into the 
constraint language (or anything equivalent that al- 
lows giving names to complex constraints and refer- 
encing to them via their names). Therefore we en- 
hance the constraint language as follows. We allow 
to our disposition a second set of variables, called 
names, and two special forms of constraints 

1. def (<name> , <constraint>) 

name definition 

2. <name> name use 

with the requirements, that a name may only be 
used, if it is defined and that its definition is unique. 
Thus, the constraint E(i3) above can be written as 

(0 r(6) A ... A 2 6 A N 
V r(7) A ... A 26 A N ) 
A def(N, r (i 8 ) A 027 A 0,. ( 2i) A 028 A 29 ) 

The packed semantic representation as con- 
structed by the method described so far still calls 
for an obvious improvement. Very often the dif- 
ferent branches of disjunctions contain constraints 
that have large parts in common. However, although 
these overlaps are efficiently handled on the rep- 
resentational level, they are invisible at the logical 
level. Hence, what we need is an algorithm that fac- 
tores out common parts of the constraints on the 
logical level, pushing disjunctions down.[] There are 
two routes that we can take to do this efficiently. 



In the first we consider only the structure of the 
parse forest, however ignore the content of (rule or 
leaf) constraints. I.e. we explore the fact that the 
parts of the E-constraints in a disjunction that stem 
from nodes shared by all disjuncts must be identical, 
and hence can be factored outj^j More precisely, we 
can compute for every node v the set must-occur(^) 
of nodes (transitively) dominated by v that must oc- 
cur in a tree of the forest, whenever v occurs. We can 
then use this information, when building the disjunc- 
tion Ti(y) to factor out the constraints introduced 
by nodes in must-occur (is), i.e., we build the fac- 
tor $ = A t / e must-occurM and a 'remainder' 
constraint E(z/^)\$ for each disjunct. 

The other route goes one step further and takes 
into account the content of rule and leaf constraints. 
For it we need an operation generalise that can be 
characterised informally as follows. 

For two satisfiable constraints and ip, 
generalise(0, tp) yields the triple £, 0', ip' , 
such that £ contains the 'common part' of 
and ip and 0' represents the 'remainder' 
0\£ and likewise ip' represents ip\£. 

The exact definition of what the 'common part' or 
the 'remainder' shall be, naturally depends on the 
actual constraint system chosen. For our purpose it 
is sufficient to require the following properties: 

If generalise(0, tp) i— > (£, 0', -0'), then h 
£ and tp h £ and = £ A 0' and -0 = £ A ip'. 



5 The p acked QLFs in the Core Language Engine ( Al- 
shawi 92) are an example of such a representation. 



3 Actually, in the £(£>) example such a factoring 



makes the use of the name N superfluous. In general, 
however, use of names is actually necessary to avoid ex- 
ponentially large constraints. Subtrees may be shared 
by quite different parts of the structure, not only by dis- 
juncts of the same disjunction. In the PP-attachment ex- 
ample, a compression of the E-constraint to polynomial 
size cannot be achieved with factoring alone. 

7 (Maxwell IIIKaplan 93) exploit the same idea for 
efficiently solving the functional constraints that an LFG 
grammar associates with a parse forest. 



We shall call such a generalisation operation sim- 
plifying if the normal form of £ is not larger than 
any of the input constraints' normal form. 

Example: An example for such a generalisation 
operation for Prolog's constraint system (equa- 
tions over first-order terms) is the so-called anti- 
unify operation, the dual of unification, that some 
Prolog implementations provide as a library pred- 
icate.0 Two terms Tl and T2 'anti-unify' to T, iff 
T is the (unique) most specific term that subsumes 
both Tl and T2. The 'remainder constraints' in this 
case are the residual substitutions o~\ and o-i that 
transform T into Tl or T2, respectively. 

Let us now state the method informally. We use 
generalise to factor out the common parts of dis- 
junctions. This is, however, not as trivial as it might 
appear at first sight. Generalise should operate 
on solved forms, but when we try to eliminate the 
names introduced for subtree constraints in order 
to solve the corresponding constraints, we end up 
with constraints that are exponential in size. In the 
following section we describe an algorithm that cir- 
cumvents this problem. 

3 The Algorithm 

We call an order < on the nodes of a directed 
acyclic graph G = (N, E) with nodes N and edges E 
bottom-up, iff whenever (i, j) G E ("i is a predecessor 
to j"), then j < i. 

For the sake of simplicity let us assume that 
any nonterminal node in the parse forest is binary 
branching. Furthermore, we leave implicit, when 
conjunctions of constraints are normalised by the 
constraint solver. Recall that for the generalisation 
operation it is usually meaningful to operate on 
solved forms. However, at least the simplifications 
true A 4> = (f> and <fi A true = should be assumed. 

The Packed Semantics Construction Algorithm is 
given in Fig. ^. It enforces the following invariants, 
which can easily be shown by induction. 

1. Every name used has a unique definition. 

2. For any node v we have the equivalence £(z/) = 
SEM[z/] A [D[i/]], where [D[j/]] shall denote the 
constraint obtained from D [v\ when recursively 
replacing names by the constraints they are 
bound to in ENV. 

3. For any node v the constraint SEM[^] is never 
larger than the E-constraint of any single tree 
in the forest originating in v. 



8 anti_unify in Quiritus Prolog 
Sicstus Prolog. 



term_subsumer in 



Hence, the returned constraint correctly represents 
the semantic representation for all readings. 

4 Complexity 

The complexity of this abstract algorithm depends 
primarily on the actual constraint system and gen- 
eralisation operation employed. But note also that 
the influence of the actual semantic operations pre- 
scribed by the grammar can be vast, even for the 
simplest constraint systems. E.g., we can write a 
DCGs that produce abnormal large "semantic struc- 
tures" of sizes growing exponentially with sentence 
length (for a single reading). For meaningful gram- 
mars we expect this size function to be linear. There- 
fore, let us abstract away this size by employing 
a function /g(ti) that bounds the size of semantic 
structures (respectively the size of its describing con- 
straint system in normal form) that grammar G as- 
signs to sentences of length n. 

Finally, we want to assume that generalisation is 
simplifying and can be performed within a bound of 
g(m) steps, where m is the total size of the input 
constraint systems. 

With these assumptions in place, the time com- 
plexity for the algorithm can be estimated to be (n 
= sentence length, TV = number of forest nodes) 

0(g(f G (n)) ■ N) < 0(g(f G (n)) ■ n 3 ), 

since every program step other than the generali- 
sation operation can be done in constant time per 
node. Observe that because of Invariant 3. the input 
constraints to generalise are bounded by fa as any 
constraint in SEM. 

In the case of a DCG the generalisation oper- 
ation is anti_unify, which can be performed in 
o(n ■ login)) time and space (for acyclic struc- 
tures). Hence, together with the assumption that 
the semantic structures the DCG computes can 
be bounded linearly in sentence length (and are 
acyclic), we obtain a 0(n ■ log(n) ■ N) < 0(n 4 log(n)) 
total time complexity. 

5 Implementation and Experimental 
Results 

The algorithm has been implemented for the PRO- 
LOG (or DCG) constraint system, i.e., constraints 
are equations over first-order terms. Two implemen- 
tations have been do ne. One in the conc urrent con- 
straint language OZ (ISmolkaTrcincn 96|) and one in 
Sicstus Prolog]^ The following results relate to the 

9 The OZ implementation has the advantage that fea- 
ture structure constraint solving is built-in. Our imple- 



Input: • parse forest, leaf and rule constraints as described above 

• array of variables X v indexed by node s.t. if v is a child of OR-node i/', then X u = X u i 

Data structures: • an array SEM of constraints and an array D of names, both indexed by node 
• a stack ENV of def constraints 

Output: a constraint representing a packed semantic representation 

Method: ENV := nil 

process nodes in a bottom-up order 
doing with node v. 
if v is a leaf then 

SEM[i/] := <j> v 

D[f] := true 
elseif v is AND(^i, v 2 ) then 

SEM[i/] := </v H A SEM[z/ x ] A SEM[zaj] 

if D[i/i] = true then D[v] := D[u 2 ] 

elseif D[^2] = true then T)\v\ := T>\v\\ 

else T)\v\ :— newname 

push def(D[i/], D[^i] A D[v> 2 }) onto ENV 

end 

elseif v is OR(^i, v 2 ) then 

let GEN, REM1, REM2 such that 

generalise(SEM[i/i],SEM[zA,]) ^ (GEN, REM1, REM2) 
SEM[u] := GEN 
D[^] := newname 

push def(D[i/], REM1 A D[ui] V REM2 A D[v 2 \) onto ENV 
end return SEM[root] A B[root] A ENV 



Figure 4: Packed Semantics Construction Algorithm 



Prolog implementation]^ 

Fig. |5| shows the resulting packed UDRS for the 
example forest in Fig. 0. Fig. ^| displays the SEM 
part as a graph. The disjunctive binding environ- 
ment only encodes what the variable referents B and 
D (in conjunction with the corresponding labels A 
and C) may be bound to to: one of el, x2, or x3 (and 
likewise the corresponding label) . Executing the goal 
dEnv(509,l, [B,A,D,C]) yields the five solutions: 
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= x3 
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mentation actually represents the DCG terms as a fea- 
ture structures. Unfortunately it is an order of magni- 
tude slower than the Prolog version. The reason for this 
presumably lies in the fact that meta-logical operations 
the algorithm needs, like generalise and copy_term 
have been modeled in OZ and not on the logical level 
were they properly belong, namely the constraint solver. 
10 This implementation is available from 



ltop 

anchor(x 1 , ' Speaker' ) 



x2 
man(x2) 



11 




see(el,xl,x2) 



14 




15 


x3 

hill(x3) 


<^verj! 











x4 

tele(x4) 



on(B,x3) 



with(D,x4) 



http://www.ims.uni-stuttgart.de/~jochen/CBSem 



Figure 6: Conjunctive part of UDRS, graphically 

Table [j] gives execution times used for semantics 
construction of sentences of the form I saw a man 
(on a hill)' 1 for different n. The machine used for 
the experiment was a Sun Ultra-2 (168MHz), run- 
ning Sicstus 3.0#3. In a further experiment an n-ary 
anti_unify operation was implemented, which im- 
proved execution times for the larger sentences, e.g., 
the 16 PP sentence took 750 msec. These results ap- 



SEM[top]: 



[ltop : xl, 

anchor (xl, ' Speaker' ) , 

11 : see (el, xl, x2) , 
It (12, ltop) , 

It (11, 12) , 

12 : x2, 

12 : man(x2), 
A : on(B,x3), 
It (13, ltop) , 
It (A, 15) , 

14 : x3, 

13 : every (14, 15) , 

14 : hill(x3), 
C : with (D, x4) , 
It (16, ltop) , 

It (C, 16) , 

16 : x4, 

16 : tele (x4) ] 



D[top] (a Prolog goal): 



ENV (as Prolog predicates) : 

dEnv(506, 1, A) :- 

( A=[el,ll] 
; A=[x2,12] 
) ■ 

dEnv(339, 1, A) :- 

( A=[C,B,C,B] 
; A=[x3, 14,_,_] 
) • 

dEnv(509, 2, A) :- 

( A= [el, 11, x3, 14] 
; A= [x2, 12, C, B] , 

dEnv(339, 1, [C,B,x2,12]) 

) ■ 

dEnv(509, 1, A) :- 

( A=[G,F,el, 11] , 

dEnv(506, 1, [G, F ] ) 
; A=[E,D,C,B], 

dEnv(509, 2, [E,D,C,B]) 

) ■ 



dEnv (509, 1, [B, A, D, C] ) ) 



Figure 5: Packed UDRS: conjunctive part (left column) and disjunctive binding environment 



n 


Readings 


AND- + OR-nodcs 


Time 


2 


5 


35 


4 msec 


4 


42 


91 


16 msec 


6 


429 


183 


48 msec 


8 


4862 


319 


114 msec 


10 


58786 


507 


220 msec 


12 


742900 


755 


430 msec 


14 


9694845 


1071 


730 msec 


16 


129 Mio. 


1463 


1140 msec 



Table 1: Execution times 



may still prove useful, when this restriction is not 
fulfilled, since it focuses on computing the common 
information of disjunctive branches. The conjunctive 
part of the output constraint of the algorithm can 
then be seen as an approximation of the actual re- 
sult, if the output constraint is satisfiable. Moreover, 
the disjunctive parts are reduced, so that a subse- 
quent full-fledged search will have considerably less 
work than when directly trying to solve the original 
constraint system. 



proximately fit the expectations from the theoretical 
complexity bound. 

6 Discussion 

Our algorithm and its implementation show that it 
is not only possible in theory, but also feasible in 
practice to construct packed semantical representa- 
tions directly from parse forests for sentence that ex- 
hibit massive syntactic ambiguity. The algorithm is 
both in asymptotic complexity and in real numbers 
dramatically faster than an earlier approach, that 
also tries to provide an underspecified semantics for 
syntactic ambiguities. The algorithm has been pre- 
sented abstractly from the actual constraint system 
and can be adapted to any constraint-based gram- 
mar formalism. 

A critical assumption for the method has been 
that semantic rules never fail, i.e., no search is in- 
volved in semantics construction. This is required 
to guarantee that the resulting constraint is a kind 
of 'solved form' actually representing so-to-speak the 
free combination of choices it contains. Nevertheless, 
our method (modulo small changes to handle failure) 
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